Multi-Model and Crosslingual Dependency Analysis
نویسندگان
چکیده
This paper describes the system of the team Orange-Deskiñ, used for the CoNLL 2017 UD Shared Task. We based our approach on an existing open source tool (BistParser), which we modified in order to produce the required output. Additionally we added a kind of pseudoprojectivisation. This was needed since some of the task’s languages have a high percentage of non-projective dependency trees. In most cases we also employed word embeddings. For the 4 surprise languages, the data provided seemed too little to train on. Thus we decided to use the training data of typologically close languages instead. Our system achieved a macro-averaged LAS of 68.61% (10th in the overall ranking) which improved to 69.38% after bug fixes.
منابع مشابه
Parsing Natural Language Sentences by Semi-supervised Methods
We present our work on semi-supervised parsing of natural language sentences, focusing on multi-source crosslingual transfer of delexicalized dependency parsers. We first evaluate the influence of treebank annotation styles on parsing performance, focusing on adposition attachment style. Then, we present KLcpos3 , an empirical language similarity measure, designed and tuned for source parser we...
متن کاملExploring Cross-Lingual Transfer of Morphological Knowledge In Sequence-to-Sequence Models
Multi-task training is an effective method to mitigate the data sparsity problem. It has recently been applied for crosslingual transfer learning for paradigm completion—the task of producing inflected forms of lemmata—with sequenceto-sequence networks. However, it is still vague how the model transfers knowledge across languages, as well as if and which information is shared. To investigate th...
متن کاملPower SystemAnalysis for Nonsinusoidal Steady State Studies Based onWavelets
In this paper power system model is represented in a new domain that relates to Multi-Resolution Analysis (MRA) space. By developing mathematical model of elements in this space using Galerkin method, a new alternative method for power system simulation in nonsinusoidal and periodic conditions is developed. The mathematical formulation and characteristics of new proposed space is expressed. Als...
متن کاملInverted indexing for cross-lingual NLP
We present a novel, count-based approach to obtaining inter-lingual word representations based on inverted indexing of Wikipedia. We present experiments applying these representations to 17 datasets in document classification, POS tagging, dependency parsing, and word alignment. Our approach has the advantage that it is simple, computationally efficient and almost parameter-free, and, more impo...
متن کاملCross-Lingual Syntactically Informed Distributed Word Representations
We develop a novel cross-lingual word representation model which injects syntactic information through dependencybased contexts into a shared cross-lingual word vector space. The model, termed CLDEPEMB, is based on the following assumptions: (1) dependency relations are largely language-independent, at least for related languages and prominent dependency links such as direct objects, as evidenc...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2017